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Suppose that a quantum source is known to have von 
Neumann entropy less than or equal to S but is otherwise 
completely unspecified. We describe a method of universal 
quantum data compression which will faithfully compress the 
quantum information of any such source to S qubits per signal 
(in the limit of large block lengths). 
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The question of compressibility of information is one 
of the central issues in information theory. For classical 
information Shannon's noiseless coding theorem pro- 
vides a tight bound (equal to the Shannon entropy of the 
source) on the extent to which information may be com- 
pressed. For quantum information an analogous tight 
bound (equal to the von Neumann entropy of the source) 
was established by Schumacher |^ and further developed 
The methods of information compression which 



are generally used to establish these results, are source 
specific i.e. they apply only to each given source sep- 
arately. As elaborated below, the classical compression 
protocol requires knowledge of the probability distribu- 
tion of the source and the quantum compression protocol 
requires knowledge of the density matrix of the source. In 
this letter we will consider the question of universal quan- 
tum information compression. Is there a protocol which 
will faithfully compress quantum information even if we 
do not know the density matrix of the source? More pre- 
cisely, suppose that all we know about the source is that 
its von Neumann entropy does not exceed some given 
value S. Is it then still possible to faithfully compress 
the quantum information to S qubits per signal? Re- 
markably, in the case of classical information such uni- 
versal compression schemes are known to exist. An ex- 
plicit example is a scheme based on the theory of types 
developed by Csiszar and Korner [Q (which is also de- 
scribed in S12.3 of 101). In this letter we will establish 



the existence of universal compression schemes for quan- 
tum information. 

We begin with an outline of some source-specific com- 
pression schemes which may be used to realise the Shan- 
non and Schumacher bounds. Later our main results will 
be related to an extension of constructions occurring in 
these schemes. Consider a source of classical informa- 
tion which generates signal i with probability pi. Note 
that the signals may be faithfully represented using log N 
bits/signal by just using their names (here N is the num- 
ber of signals; in this letter logarithms are always to base 
2). Let S ~ — Pi log Pi be the Shannon entropy of the 
source (which is always < log N) . Shannon's theorem as- 
serts that the signals may be represented asymptotically 
faithfully using only 5* bits/signal and no fewer number 
of bits can suffice for this task. Thus a sender (Alice) 
can communicate the sequence of generated signals to a 
receiver (Bob) by sending S bits/signal and this transmis- 
sion rate is optimal. The compression may be achieved 
by the following method of block coding i.e. processing 
long sequences of signals rather than individual signals 
themselves separately. Note that we do not require that 
Bob is able to recover the signals perfectly but only that 
the probability of any error tends to zero in the limit of 
increasing block length. (This is the meaning of the term 
"asymptotically faithfully" ) . Consider all possible signal 
sequences iiZ2 . . . i„ of length n (with associated proba- 
bility Pi^Pi^ ■ ■ - Pin)- Let SEQ(n) be the set of all such 
sequences of length n. Our basic ingredient is the Theo- 
rem of Typical Sequences which asserts the following: 
Theorem of Typical Sequences: For any given e > and 
6 > and for all sufficiently large n there is a subset 
TYP{n) C SEQ{n) which has size 2"('5+'5) (i.e. an ex- 
ponentially small fraction of SEQ{n)) but whose total 
probability exceeds 1 — e (i.e. is as high as desired). 
The sequences in TYP{n) are called typical sequences 
and those not in TYP{n) are called atypical sequences. 
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Intuitively this theorem asserts that (for all sufficiently 
large n) any sequence of signals generated by the source 
may be assumed with arbitrarily high probability, to be 
a typical sequence. Thus to achieve compression to S 
bits/signal Alice and Bob set up a list of names of all 
the typical sequences (requiring n{S + 6) bits per typical 
sequence) . Then for sequences of length n generated by 
the source Alice sends the name of the sequence if it is a 
typical sequence and the name of some fixed chosen typ- 
ical sequence if it is atypical. In the latter case Bob will 
be unable to regenerate the correct message and an error 
will have occurred. However according to the theorem 
of typical sequences, this can be arranged to occur with 
arbitrarily small probability by choosing n large enough. 

The compression of quantum information was first con- 
sidered by Schumacher who developed a quantum ana- 
logue of Shannon's theorem. The quantum compression 
protocol was subsequently simplified by Schumacher and 
Jozsa 1^ (hereafter referred to as the S3 protocol) and 
later Barnum et. al. ^ showed that the limit of com- 
pression provided by the SJ protocol is optimal i.e. that 
no other conceivable compression protocol can provide 
further asymptotically faithful compression. 

Consider a source of quantum states which produces 
pure states ji/^i) S Ti. with probabilities pi. Let p = 
SiPj|V'i)('0i| be the density matrix of the source and let 
S{p) — —trp log p be its von Neumann entropy. Then the 
SJ protocol 1^ provides asymptotically faithful compres- 
sion to S{p) qubits per signal state. The method rests 
again on the theorem of typical sequences above. Note 
that the density matrix of all signal sequences of length 
n is just p®"- = p ® . . . ® p. Let denote the eigen- 
values of p so that the eigenvalues of p®" are given by 
all products of the form Aij...i„ = A^j^ . . . Ai„. Let A(n) 
be the subspace of 7i®" given by the span of all eigen- 
states |Aij ... Ai„) corresponding to all typical sequences 
ii . . . i„ of eigenvalues. A(n) is called the typical subspace 
(for block length n). Since the Shannon entropy of the 
distribution Ai is equal to the von Neumann entropy S{p) 
we see that dimA(rt) — 2"('^('')+'') i.e. the typical sub- 
space occupies about nS{p) qubits. Let 11 denote the 
projection onto the typical subspace. Then by consid- 
ering p®" in its eigenbasis and recalling the theorem of 
typical sequences we easily see that 

trp®"n > 1 - e (1) 

This gives the SJ compression protocol: for sufficiently 
large n Alice accumulates a sequence of n signal states 
l^in) = l^ji) ■ • ■ IV'jn) ^-iid performs a measurement 
which determines whether the joint state lies in A(n) or 
its orthogonal complement i.e. the joint state is pro- 
jected into one or other of these complementary sub- 
spaces. If the state projects to A(n) Alice sends the 
resulting n{S{p) + 6) qubits to Bob. If it projects to 
the orthogonal complement (which occurs with probabil- 



ity < e) she sends to Bob any chosen state of A(n). Now, 
as proved in equation (|^) implies that 

{lpin\Pout\ll^in) > 1 - 2e 

where pout is the state obtained by Bob if ji/'m) was gen- 
erated by the source (and the average denoted by the 
overbar is taken over all input blocks of signals |i/'m))- 
Thus, Bob receives the state \ipin) with arbitrarily high 
fidelity and in the limit of (5 ^ only S qubits / signal 
were transmitted. 

We now come to the issue of universal compression. 
The above compression schemes based on typical se- 
quences and the typical subspace are source-specific. For 
classical compression we need to know the probability 
distribution of the source in order to identify the typical 
sequences. For quantum compression we need to know 
the density matrix of the source to identify the typical 
subspace. Already here there is a significant difference: 
in the quantum case we need to know neither the iden- 
tity of the signal states nor their probabilities, only their 
overall density matrix. Our main result below will be 
more remarkable showing that we do not even need to 
know the density matrix to achieve faithful compression 
i.e. there exists a universal quantum compression pro- 
tocol which will faithfully asymptotically compress any 
quantum source with von Neumann entropy < 5 to S' 
qubits per signal. 

But first, to illustrate the utility of the result, consider 
the recently investigated problem of compression of quan- 
tum information with incomplete data pO| ] . Namely, sup- 
pose that the information about the source is obtained 
via measurements performed over a subensemble of the 
generated signal sequence. Suppose further, that the set 
of measured observables was too small to ensure a com- 
plete reconstruction of the density matrix of the source. 
The question was: what is the maximal possible compres- 
sion rate R allowing faithful transmission in this case? It 
has been pointed out in [p^ that the Jaynes maximal 
entropy principle ||ll|] places a lower bound on R: 

R>Sj (2) 

Here Sj is maximal entropy admissible by the measured 
mean values (Jaynes entropy). It has been also shown 
that for any qubit source the inequality passes into equal- 
ity. Now, applying the universal quantum compression 
protocol we obtain that the equality holds in the gen- 
eral case, so that the Jaynes entropy gives the optimal 
compression with incomplete experimental data charac- 
terizing the source. 

We will now briefly outline a method of classical uni- 
versal data compression. Suppose we have a classical 
source and we know only that its Shannon entropy is 
less than some given number S (and we do not know 
its probability distribution). Then a result of Csiszar 
and Korner |7| shows that there exists a set of sequences 
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CK{n) C SEQ{n) of length n (whose description de- 
pends only on the value of S) which satisfies all of the 
properties enjoyed by TYP{n) in the theorem of typical 
sequences not only for some one probability distribution 
with Shannon entropy S hut simultaneously for all dis- 
tributions with entropy < S i.e. the total probability 
of CK[n) with respect to any such distribution exceeds 
1 - e and the size of CK{n) is 2'^'^^+^\ The explicit 
construction of CK{n) is also described in §12.3 of [Q. 
Hence if we replace TYP{n) by CK{n) in the classical 
compression scheme described previously we will have a 
universal compression scheme which faithfully asymptot- 
ically compresses any source with Shannon entropy < S 
to S bits/signal. 

Consider next the prospect of replacing TYP{n) by 
CK{n) in the SJ protocol. It is not difficult to see 
that this modified protocol will faithfully compress to 
S qubits/signal all those quantum sources whose den- 
sity matrices commute with p and have von Neumann 
entropy < S. Thus this does not provide a fully uni- 
versal quantum compression scheme: if we consider all 
possible sources with von Neumann entropy < S then 
their density matrices need not commute. Below we de- 
scribe an alternative quantum compression scheme which 
is fully universal. 

For any given p let S be the subspace of 7i®" in the 
modified SJ protocol, which is spanned by all eigenstates 
of p®" labelled by CK sequences i.e. S is the analogue 
of the typical subspace A{n). Thus projection onto S 
will achieve faithful compression for all sources with von 
Neumann entropy < S whose density matrices commute 
with p. A set of mutually commuting density matrices 
is characterised by the corresponding common eigenba- 
sis and this may be any chosen orthonormal basis of TL. 
Thus as p varies over all possible density matrices with 
von Neumann entropy < S there will be a subspace S as- 
sociated with each choice of orthonormal basis of TL. We 
make this dependence explicit by writing S(i3) (where 
B denotes an orthonormal basis of 71) and we suppress 
explicit mention of the values of n and S on which S also 
depends. 

Now let T be the smallest subspace of 7i®" which con- 
tains S(i?) for all choices of basis B. Then projection 
into T will achieve quantum compression for all sources 
with von Neumann entropy < S. Below we will prove 
that 

dim T < (71 + 1)'*' 2"(^+'') (3) 

where d = dimTi, n is the block length and 5 > may 
be as small as desired. Thus we will achieve universal 
compression to R qubits/signal where R is given by 

^.^ logdimT^ ^.^ ^Mn + l)^^^^ 

n — *oo fi n — >oo fi 

which tends io S -\- 5 qubits/signal. Since 6 can be as 
small as desired, asymptotically we have S qubits/signal. 



This is our universal quantum information compression 
scheme. 

To prove (^) let B'^ — {ej, . . . , e°} be any fixed cho- 
sen orthonormal basis of TL. Then any other basis 
B — {ei, . . . , ed\ is obtained from i?" by applying some 
dx d unitary transformation U . Now S(i?) is the span of 
2n{s+s) grates of the form Ci^®.. .^ei^ (where we choose 
all CK sequences of the labels). Denote this basis by 
CK{B). Hence S(i3) is precisely the subspace obtained 
by applying [/®" to S(i3°) (where C/®" is the unitary 
transformation on given hy U ® . . .®U). Then T is 
the span of all S(i3) as B ranges over all bases, which in 
turn equals the span of all U^"(j) where U ranges over all 
d X d unitary matrices and (j) ranges over CK{B'^). Let 
Md denote the linear space of all d x d complex matrices. 
Since contains all unitary matrices we get 

T C span{A®"0 : A e Afd, e Cif(B°)} (4) 

For any fixed (p let 

n<t, = span{A^"-<j> : A e M^} (5) 

We will show that 

dim n4,<{n + if^ (6) 

Then using (|) and the fact that dim S(B") ^ 2"(-5+'^) 
we will immediately obtain our desired result (|^). 

To prove (^ we use the notion of the symmetric sub- 
space. 

Definition. The symmetric subspace of a space TC^" is 
the space SYM{TL) of the vectors which are invariant un- 
der any permutation of the positions in the tensor prod- 
uct. 

The symmetric subspace has found various applica- 
tions in quantum information theory In |^ it is 
proved that the space SYM{TL) has the following prop- 
erties: 

(i) it is spanned by the vectors of the form ■0®"' 

(ii) its dimension is equal to ^"^.f^^^ where d = 
AimTL. In fact by considering the symmetrisation of a 
product basis of it is easy to obtain the simpler 
over-estimate dim SYM{TL) < {n~\- lY which will suffice 
for our purposes. 

An important point to note is that for fixed d and vary- 
ing n the size of SYM{TL) grows only polynomially with 
n whereas the full space 7i®" (of dimension d") grows ex- 
ponentially. Thus SYM{TL) becomes exponentially small 
inside 7^®" as n grows. 

Since Md is a linear space we can consider M^" and 
the symmetric subspace SYM(Md) C M®". According 
to (i) 

SYM{Md) = span{A®" : A S Md] 
and hence (P) gives 
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= span{B<l> : B € SYM{Md)} 

Now, we can define a linear mapping T from tlie space 
SYM{Md) to H4, by 



SYM{Md) 3B — > T{B) =B(j)e'H. 



(7) 



Tlris mapping is onto tlie space 7i^, and since it is linear 
it cannot increase dimension. Hence 

dim?^^ < dim SYM{Md) 

Recalling that dim Md — cf, (ii) gives that 



dim SYM{Md) = 



which proves (^) and completes the proof of (^). 

Thus we have shown that for any given S and suffi- 
ciently large n, projection into T{S,n) will provide uni- 
versal quantum data compression to S qubits/signal for 
all sources of pure quantum states with von Neumann 
entropy < S. The same method will also work faithfully 
for all sources of mixed states pi where the von Neumann 
entropy of p = '^^PiPi does not exceed S. Indeed we may 
always represent each of these mixed states as a proba- 
bilistic mixture of pure states whose identities we have 
forgotten. Also according to the results of Barnum et. 
al. 1^ our compression scheme is optimal - compression 
beyond S qubits/signal cannot be faithful for sources of 
entropy equal to 5* and hence cannot be faithful for all 
sources of entropy < S. 

Finally we remark that our bound on dimT, al- 
though sufficient for our purposes, is not generally tight. 
Indeed all we needed to show was that dim T is some poly- 
nomial (in n) multiple of dim 'E,{B^). It is interesting to 
note that dim T can be calculated exactly for the case of 
S = 0. Here we are considering all possible trivial sources 
which generate repeatedly one and the same vector 
-0 (i.e. have von Neumann entropy zero). For the 
subspace S and the typical subspace are both just the 
one dimensional span{ip (g) . . . O ■0} G Ti*^". Hence T is 
the span of all states of the form tfj(E). . .®ip and by (i) we 
see that T equals SYM{7i) in this case. As noted pre- 
viously, SYM{7i) becomes vanishingly small inside 
as n increases so the number of qubits per signal used for 
faithful transmission tends to zero with increasing n. 
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